"Beirut" meaning in All languages combined

See Beirut on Wiktionary

Proper name [Almanî]

  1. Beyrût
    Sense id: ku-Beirut-de-name-sPhr1Nl2 Categories (other): Bajar bi almanî
The following are not (yet) sense-disambiguated

Proper name [Danmarkî]

  1. Beyrût
    Sense id: ku-Beirut-da-name-sPhr1Nl2 Categories (other): Bajar bi danmarkî
The following are not (yet) sense-disambiguated

Proper name [Endonezyayî]

  1. Beyrût
    Sense id: ku-Beirut-id-name-sPhr1Nl2 Categories (other): Bajar bi endonezyayî
The following are not (yet) sense-disambiguated

Proper name [Estonî]

  1. Beyrût
    Sense id: ku-Beirut-et-name-sPhr1Nl2 Categories (other): Bajar bi estonî
The following are not (yet) sense-disambiguated
Categories (other): Estonî, Serenav bi estonî

Noun [Fînî]

  1. Beyrût, Bêrût
    Sense id: ku-Beirut-fi-noun-K1dtBx-i
The following are not (yet) sense-disambiguated
Categories (other): Fînî, Navdêr bi fînî

Proper name [Katalanî]

  1. Beyrût
    Sense id: ku-Beirut-ca-name-sPhr1Nl2 Categories (other): Bajar bi katalanî
The following are not (yet) sense-disambiguated
Categories (other): Katalanî, Serenav bi katalanî

Proper name [Malezî]

  1. Beyrût
    Sense id: ku-Beirut-ms-name-sPhr1Nl2 Categories (other): Bajar bi malezî
The following are not (yet) sense-disambiguated
Categories (other): Malezî, Serenav bi malezî

Proper name [Norweciya bokmålî]

  1. Beyrût
    Sense id: ku-Beirut-nb-name-sPhr1Nl2 Categories (other): Bajar bi norweciya bokmålî
The following are not (yet) sense-disambiguated

Proper name [Norweciya nînorskî]

  1. Beyrût
    Sense id: ku-Beirut-nn-name-sPhr1Nl2 Categories (other): Bajar bi norweciya nînorskî
The following are not (yet) sense-disambiguated

Proper name [Romanyayî]

  1. Beyrût
    Sense id: ku-Beirut-ro-name-sPhr1Nl2 Categories (other): Bajar bi romanyayî
The following are not (yet) sense-disambiguated
Categories (other): Romanyayî, Serenav bi romanyayî

Proper name [Skotî]

  1. Beyrût
    Sense id: ku-Beirut-sco-name-sPhr1Nl2 Categories (other): Bajar bi skotî
The following are not (yet) sense-disambiguated
Categories (other): Serenav bi skotî, Skotî

Proper name [Spanî]

  1. Beyrût
    Sense id: ku-Beirut-es-name-sPhr1Nl2 Categories (other): Bajar bi spanî
The following are not (yet) sense-disambiguated
Categories (other): Serenav bi spanî, Spanî

Proper name [Swêdî]

  1. Beyrût
    Sense id: ku-Beirut-sv-name-sPhr1Nl2 Categories (other): Bajar bi swêdî
The following are not (yet) sense-disambiguated

Proper name [Îdoyî]

  1. Beyrût
    Sense id: ku-Beirut-io-name-sPhr1Nl2 Categories (other): Bajar bi îdoyî
The following are not (yet) sense-disambiguated
Categories (other): Serenav bi îdoyî, Îdoyî

Noun [Îngilîzî]

Audio: LL-Q1860 (eng)-Vealhurl-Beirut.wav
  1. Beyrût, Bêrût
    Sense id: ku-Beirut-en-noun-K1dtBx-i
The following are not (yet) sense-disambiguated

Proper name [Îtalî]

  1. Beyrût
    Sense id: ku-Beirut-it-name-sPhr1Nl2 Categories (other): Bajar bi îtalî
The following are not (yet) sense-disambiguated
Categories (other): Serenav bi îtalî, Îtalî
{
  "categories": [
    {
      "kind": "other",
      "name": "Almanî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenav bi almanî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenavên nêtar bi almanî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Almanî",
  "lang_code": "de",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi almanî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-de-name-sPhr1Nl2"
    }
  ],
  "tags": [
    "gender-neutral"
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Danmarkî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenav bi danmarkî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenavên nêtar bi danmarkî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Danmarkî",
  "lang_code": "da",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi danmarkî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-da-name-sPhr1Nl2"
    }
  ],
  "tags": [
    "gender-neutral"
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Estonî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenav bi estonî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Estonî",
  "lang_code": "et",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi estonî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-et-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Fînî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Navdêr bi fînî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Fînî",
  "lang_code": "fi",
  "pos": "noun",
  "pos_title": "Navdêr",
  "senses": [
    {
      "glosses": [
        "Beyrût, Bêrût"
      ],
      "id": "ku-Beirut-fi-noun-K1dtBx-i"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Endonezyayî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenav bi endonezyayî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Endonezyayî",
  "lang_code": "id",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi endonezyayî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-id-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Serenav bi îdoyî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Îdoyî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Îdoyî",
  "lang_code": "io",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi îdoyî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-io-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Deng bi îngilîzî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Navdêr bi îngilîzî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Îngilîzî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Îngilîzî",
  "lang_code": "en",
  "pos": "noun",
  "pos_title": "Navdêr",
  "senses": [
    {
      "glosses": [
        "Beyrût, Bêrût"
      ],
      "id": "ku-Beirut-en-noun-K1dtBx-i"
    }
  ],
  "sounds": [
    {
      "audio": "LL-Q1860 (eng)-Vealhurl-Beirut.wav",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/9/91/LL-Q1860_(eng)-Vealhurl-Beirut.wav/LL-Q1860_(eng)-Vealhurl-Beirut.wav.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/9/91/LL-Q1860_(eng)-Vealhurl-Beirut.wav/LL-Q1860_(eng)-Vealhurl-Beirut.wav.ogg",
      "raw_tags": [
        "Başûrê Îngilistanê, QY"
      ],
      "wav_url": "https://commons.wikimedia.org/wiki/Special:FilePath/LL-Q1860 (eng)-Vealhurl-Beirut.wav"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Serenav bi îtalî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Îtalî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Îtalî",
  "lang_code": "it",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi îtalî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-it-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Katalanî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenav bi katalanî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Katalanî",
  "lang_code": "ca",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi katalanî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-ca-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Malezî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenav bi malezî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Malezî",
  "lang_code": "ms",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi malezî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-ms-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Norweciya bokmålî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenav bi norweciya bokmålî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Norweciya bokmålî",
  "lang_code": "nb",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi norweciya bokmålî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-nb-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Norweciya nînorskî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenav bi norweciya nînorskî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Norweciya nînorskî",
  "lang_code": "nn",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi norweciya nînorskî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-nn-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Romanyayî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenav bi romanyayî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Romanyayî",
  "lang_code": "ro",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi romanyayî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-ro-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Serenav bi skotî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Skotî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Skotî",
  "lang_code": "sco",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi skotî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-sco-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Serenav bi spanî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Spanî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Spanî",
  "lang_code": "es",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi spanî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-es-name-sPhr1Nl2"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    {
      "kind": "other",
      "name": "Serenav bi swêdî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Serenavên nêtar bi swêdî",
      "parents": [],
      "source": "w"
    },
    {
      "kind": "other",
      "name": "Swêdî",
      "parents": [],
      "source": "w"
    }
  ],
  "lang": "Swêdî",
  "lang_code": "sv",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Bajar bi swêdî",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "Beyrût"
      ],
      "id": "ku-Beirut-sv-name-sPhr1Nl2"
    }
  ],
  "tags": [
    "gender-neutral"
  ],
  "word": "Beirut"
}
{
  "categories": [
    "Almanî",
    "Serenav bi almanî",
    "Serenavên nêtar bi almanî"
  ],
  "lang": "Almanî",
  "lang_code": "de",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi almanî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "tags": [
    "gender-neutral"
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Danmarkî",
    "Serenav bi danmarkî",
    "Serenavên nêtar bi danmarkî"
  ],
  "lang": "Danmarkî",
  "lang_code": "da",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi danmarkî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "tags": [
    "gender-neutral"
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Endonezyayî",
    "Serenav bi endonezyayî"
  ],
  "lang": "Endonezyayî",
  "lang_code": "id",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi endonezyayî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Estonî",
    "Serenav bi estonî"
  ],
  "lang": "Estonî",
  "lang_code": "et",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi estonî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Fînî",
    "Navdêr bi fînî"
  ],
  "lang": "Fînî",
  "lang_code": "fi",
  "pos": "noun",
  "pos_title": "Navdêr",
  "senses": [
    {
      "glosses": [
        "Beyrût, Bêrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Katalanî",
    "Serenav bi katalanî"
  ],
  "lang": "Katalanî",
  "lang_code": "ca",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi katalanî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Malezî",
    "Serenav bi malezî"
  ],
  "lang": "Malezî",
  "lang_code": "ms",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi malezî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Norweciya bokmålî",
    "Serenav bi norweciya bokmålî"
  ],
  "lang": "Norweciya bokmålî",
  "lang_code": "nb",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi norweciya bokmålî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Norweciya nînorskî",
    "Serenav bi norweciya nînorskî"
  ],
  "lang": "Norweciya nînorskî",
  "lang_code": "nn",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi norweciya nînorskî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Romanyayî",
    "Serenav bi romanyayî"
  ],
  "lang": "Romanyayî",
  "lang_code": "ro",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi romanyayî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Serenav bi skotî",
    "Skotî"
  ],
  "lang": "Skotî",
  "lang_code": "sco",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi skotî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Serenav bi spanî",
    "Spanî"
  ],
  "lang": "Spanî",
  "lang_code": "es",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi spanî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Serenav bi swêdî",
    "Serenavên nêtar bi swêdî",
    "Swêdî"
  ],
  "lang": "Swêdî",
  "lang_code": "sv",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi swêdî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "tags": [
    "gender-neutral"
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Serenav bi îdoyî",
    "Îdoyî"
  ],
  "lang": "Îdoyî",
  "lang_code": "io",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi îdoyî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Deng bi îngilîzî",
    "Navdêr bi îngilîzî",
    "Îngilîzî"
  ],
  "lang": "Îngilîzî",
  "lang_code": "en",
  "pos": "noun",
  "pos_title": "Navdêr",
  "senses": [
    {
      "glosses": [
        "Beyrût, Bêrût"
      ]
    }
  ],
  "sounds": [
    {
      "audio": "LL-Q1860 (eng)-Vealhurl-Beirut.wav",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/9/91/LL-Q1860_(eng)-Vealhurl-Beirut.wav/LL-Q1860_(eng)-Vealhurl-Beirut.wav.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/9/91/LL-Q1860_(eng)-Vealhurl-Beirut.wav/LL-Q1860_(eng)-Vealhurl-Beirut.wav.ogg",
      "raw_tags": [
        "Başûrê Îngilistanê, QY"
      ],
      "wav_url": "https://commons.wikimedia.org/wiki/Special:FilePath/LL-Q1860 (eng)-Vealhurl-Beirut.wav"
    }
  ],
  "word": "Beirut"
}

{
  "categories": [
    "Serenav bi îtalî",
    "Îtalî"
  ],
  "lang": "Îtalî",
  "lang_code": "it",
  "pos": "name",
  "pos_title": "Serenav",
  "senses": [
    {
      "categories": [
        "Bajar bi îtalî"
      ],
      "glosses": [
        "Beyrût"
      ]
    }
  ],
  "word": "Beirut"
}

Download raw JSONL data for Beirut meaning in All languages combined (4.2kB)

{
  "called_from": "parserfns/156",
  "msg": "#tag creating non-allowed tag <phonos> - omitted",
  "path": [
    "Beirut",
    "Template:deng",
    "#tag",
    "#tag"
  ],
  "section": "Îngilîzî",
  "subsection": "Bilêvkirin",
  "title": "Beirut",
  "trace": ""
}

This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2025-05-02 from the kuwiktionary dump dated 2025-04-20 using wiktextract (bb9bcd7 and e876143). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.